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ABSTRACT 


Error  patterns  have  been  measured  on  a  combined  troposcatter, 
microwave,  wireline  circuit  of  more  than  6000  circuit  miles.  The  error 
data  consist  of  more  than  6  x  10$  transmitted  bits  at  rates  of  1200  and 
2400  bits/sec.  These  error  patterns  are  presented  in  a  form  suitable 
for  the  analysis  of  error  control,  modem  improvement  or  channel 
modeling. 
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SECTION  I 
INTRODUCTION 


In  the  spring  of  1966,  data  transmission  tests  were  conducted  by  The 
MITRE  Corporation  on  a  U.  S.  Air  Force  data  circuit  which  will  be  designated 
as  OL-5/STC.  The  objective  of  these  tests  was  to  determine  the  characteris¬ 
tics  of  the  error  patterns  which  occur  on  this  link. 

The  link  comprises  three  types  of  transmission  media:  troposcatter, 

microwave,  and  wireline.  The  troposcatter  and  microwave  are  multiple-hop 

circuits  and  the  wireline  consists  of  numerous  interconnected  leased  telephone 

wirelines.  Measurements  made  on  other  parts  of  the  circuit  indicate  that  the 

troposcatter  dominates  in  terms  of  error  generation.  Rixon  Sebit  24B  vestigial 

sideband  AM  modems  were  used  at  data  rates  of  1200  and  2400  bits/sec  in  a 

3kc  channel.  The  data  collected  consisted  of  38  hours  at  1200  bits/  sec  and 

8  8 

61  hours  at  2400  bits/  sec  for  a  total  of  1.  67  x  10  and  5.  20  x  10  bits  respec¬ 
tively. 

The  error  pattern  data  were  obtained  by  one-way  transmission  of  a 
52 -bit  digital  data  test  pattern  which  was  compared  with  a  locally  generated 
test  pattern  at  the  receiving  end.  When  there  was  agreement  between  the  two, 
a  logical  zero  was  declared;  when  the  bits  were  different,  a  logical  one  was 
declared.  The  zeros  and  ones  were  recorded  on  magnetic  tape.  These 
magnetic  tapes  were  then  processed  through  a  tape-to-tape  converter  facility 
to  produce  an  IBM  compatible  tape  for  processing  in  an  IBM  7030  computer 
for  determination  of  the  statistics  of  these  bit-by-bit  error  patterns. 


1 


SECTION  II 

DESCRIPTIVE  STATISTICS 


The  data  for  the  two  transmission  rates  were  processed  for  four  types 
of  statistics:  the  occurrence  of  consecutive  errors,  the  occurrence  of 
intervals  between  errors,  the  occurrence  of  bursts,  and  word  error  rate 
information.  The  extent  of  the  data  analyzed  was  as  follows: 

Data  Rate  Total  Bits  Average  Error  Rate 

1200  b/s  1.  67  x  108  1.  39  x  10~4 

2400  b/s  5.  20  x  108  3.  95  x  10”4 

Figure  1  presents  the  relative  frequency  of  occurrence  of  consecutive  errors 
for  the  two  data  rates.  The  figure  indicates  that  the  relative  occurrence  of 
multiple  errors  is  not  a  function  of  the  data  rate  for  these  two  rates.  Further, 
the  relative  occurrence  is  certainly  not  of  an  independent  random  nature.  If 
it  were,  over  98  percent  of  the  errors  would  occur  as  single  errors,  while 
in  fact  only  70  percent  occur  as  single  errors.  This  lack  of  independence  is 
further  illustrated  in  Figure  2  where  the  cumulative  frequency  of  occurrence 
of  error-free  gaps  is  presented  for  the  actual  data  and  the  case  of  inde¬ 
pendent  errors,  respectively.  All  these  plots  confirm  that  this  error  pattern 
data  is  widely  different  from  independent  random  errors  at  the  same  error 
rate.  The  cumulative  distribution  of  error-free  gaps  for  independent  random 
errors  is  (in  theory)  described  by  the  equation 
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Figure  1 .  Frequency  of  Consecutive  Error  Occurrence 
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Figure  2.  Distribution  of  Gaps  Between  Errors 
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P  \  cne  |  e  (  =  P(l-P)k  (1) 

k  =  0 

where 

c  represents  a  correct  bit; 
e  represents  an  error  bit; 
n  represents  number  of  consecutive  bits;  and 
p  is  the  probability  of  bit  error. 

Figure  3  presents  the  message  error  rate  (defined  as  the  occurrence 
of  at  least  one  error  in  a  message)  as  a  function  of  message  size  for  these 
two  classes  of  data.  Along  with  these  curves,  the  corresponding  values  for 
independent  error  occurrence  are  presented.  These  values  are  derived 
from  the  relation 
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where 

p  =  probability  of  a  message  error; 
m 

p  ^  =  probability  of  bit  error;  and 

G 

m  =  message  length  in  bits. 


The  interesting  thing  to  note  in  these  curves  is  that  the  value  of 

p  for  m  >  2000  in  the  actual  data  is  independent  of  error  rate  and  signaling 
m 

speed.  This  indicates  that  the  errors  are  occurring  in  bursts,  and,  while 

for  the  overall  data  p  at  2400  bits/  sec  is  2.  85  times  as  large  as  p  at 

e  e 

1200  bits/  sec ,  the  additional  errors  which  are  occurring  at  the  higher 
signaling  speed  occur  as  increased  burst  density.  Thus  it  is  evident  that 
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Figure  3.  Message  Error  Rate  vs  Message  Size 


bursts  occur  as  fades  in  time  unrelated  to  the  data  rate  (for  the  rates  meas¬ 
ured).  This  conclusion  is  confirmed  by  a  visual  observation  of  the  error 
patterns. 
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SECTION  III 

DESCRIPTION  OF  BURST  STATISTICS 


The  description  of  the  error  statistics  presented  thus  far  has  made 
it  clear  that  the  errors  fall  in  a  non-random  clumped  distribution.  To 
describe  these  error  bursts  and  the  intervals  between  bursts  (guard  space), 
an  additional  statistic  has  been  developed. 

DEFINITION  OF  BURST 

A  burst  is  defined  as  a  region  of  the  serial  data  stream  where  the 

following  properties  hold.  A  minimum  number  of  errors,  M^,  is  contained 

in  the  region,  and  the  minimum  density  of  errors  in  the  region  is  A.  Both 

of  these  conditions  must  be  satisfied  for  the  chosen  values  of  M  and  A 

e 

for  the  region  to  be  defined  as  a  burst.  The  density  of  errors  is  defined 
as  the  ratio  of  bits  in  error  to  the  total  number  of  bits  in  the  region. 

The  following  properties  hold  for  the  burst.  The  burst  always  begins 
with  a  bit  in  error  and  ends  with  a  bit  in  error.  A  burst  may  contain  correct 
bits.  Each  burst  is  immediately  preceded  and  followed  by  an  interval  in 
which  the  density  of  errors  is  less  than  A. 

The  burst  probability  density  function  is  defined  as  the  probability  of 

occurrence  of  a  burst  of  size  N,  where  N  is  any  positive  integer.  The  burst 

size  is  measured  in  terms  of  the  total  number  of  bits  in  the  burst.  A  separate 

burst  probability  density  function  may  be  determined  for  each  pair  of  values 

of  A  and  M  . 

e 
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The  minimum  number  of  errors  in  a  burst  has  been  chosen  to  be  two 

for  all  the  data  included  here.  It  was  found  that  larger  values  of  M  would 

e 

not  change  the  values  of  mean  burst  length  significantly.  However,  the 
intervals  between  the  bursts  were  found  to  increase  drastically  so  that  little 
meaningful  data  could  be  obtained  for  the  burst  to  consecutive  interval  ratio. 
When  a  value  of  one  is  selected  for  M^,  every  error  becomes  a  burst,  and 
the  requirement  that  a  burst  begin  and  end  with  an  error  is  violated.  Con¬ 
sequently,  no  meaningful  data  is  obtained  for  this  value  of  M^. 

DEFINITION  OF  INTERVAL 

The  interval  is  defined  as  the  region  of  the  serial  data  stream  where 
the  following  properties  hold.  The  minimum  density  of  errors  is  less  than 
A,  and  the  region  begins  and  ends  in  a  correct  bit.  An  interval  may  contain 
errors.  An  interval  is  always  immediately  preceded  and  followed  by  a 
burst.  Thus,  each  and  every  bit  in  the  data  stream  must  lie  in  either  a 
burst  region  or  an  interval  region. 

The  interval  probability  density  function  is  defined  as  the  probability 
of  occurrence  of  an  interval  of  length  L,  where  L  is  any  positive  integer. 

The  interval  probability  density  is  a  joint  function  of  both  A  and  M^. 

DESCRIPTIONS  OF  DISTRIBUTIONS 

Figure  4  presents  the  distribution  of  observed  burst  lengths  for  the 
actual  data.  The  burst  lengths  range  from  two  (M  )  to  over  1,  000,  000  bits. 

G 

While  90  percent  are  less  than  1000  bits  long,  the  remaining  bursts  contain 
most  of  the  errors.  The  densities  of  these  bursts  are  displayed  in  Figure  5. 
This  figure  confirms  the  previous  conclusions  that  at  2400  bits/sec,  the 
additional  errors  serve  to  increase  the  density  of  the  bursts.  Further,  while 
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Figure  4.  Distribution  of  Lengths  of  Bursts 
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BURST  ERROR  DENSITY 


Figure  5 


Distribution  of  Burst  Error  Densities 


the  minimum  criteria  on  (A)  is  chosen  at  0.  01,  over  50  percent  of  the  bursts 
have  densities  greater  than  0. 1.  A  correlation  study  was  performed  on  the 
burst  lengths  and  their  associated  densities;  and  it  was  found  that,  while 
bursts  less  than  10  bits  in  length  are  generally  100  percent  dense  (consecutive 
errors)  and  bursts  over  100,  000  bits  in  length  are  near  the  minimum 
criteria,  the  other  bursts  range  in  density  between  the  minimum  and  75  per¬ 
cent,  indicating  no  correlation  between  the  length  of  bursts  and  densities. 
Figure  6  indicates  the  distribution  of  1200  b/s  data  in  this  study.  The  value 
of  A  was  chosen  at  0.  01  because  only  for  this  value  of  A  are  burst  length  and 
density  independent  of  A.  As  a  convenience  in  plotting  the  actual  and  theoret¬ 
ical  curves,  the  values  have  been  shifted  up  by  one  (since  there  is  no  zero  on 
a  log  scale). 

The  distribution  seen  on  these  curves  indicates  that  the  errors  fall 
into  adjacent  clusters  of  errors  with  long  error-free  intervals  between 
clusters.  There  are  no  intermediate  groupings  of  errors.  In  a  theoretical, 
random  error  distribution,  the  intermediate  groupings  would  predominate. 

Figures  7  and  8  show  the  distribution  of  interval  lengths  and  associated 
densities.  The  important  facts  to  note  here  are  that  over  60  percent  of  the 
intervals  are  error  free,  and  the  remaining  intervals  have  no  more  than  one 
or  two  random  errors  within  them. 

If  the  length  of  the  interval  directly  following  every  burst  is  divided 
by  the  burst  length,  a  quantity  known  as  the  guard  space  ratio,  which  is 
useful  in  coder  design,  is  obtained.  Figure  9  indicates  that  almost  5  per¬ 
cent  of  the  guard  space  ratios  of  this  data  link  are  insufficient  (<  1)  for 
direct  code  error  correction,  but  that  the  remaining  ratios  are  sufficient. 
Unfortunately,  this  5  percent  consists  of  the  longest  bursts. 
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Figure  6.  Relationship  Between  Burst  Length  and  Burst  Density 
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Figure  7.  Distribution  of  Lengths  of  Intervals 
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Figure  8.  Distribution  of  Interval  Error  Densities 
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Figure  9.  Guard  Space  Ratio  Distribution 


SECTION  IV 


CONCLUSIONS 

The  purpose  of  this  paper  is  to  present  the  error  patterns  which 
occur  on  a  circuit  predominated  by  a  troposcatter  path.  As  indicated, 
the  error  patterns  are  certainly  in  no  respect  random.  In  general  this 
channel  has  long,  high-density  bursts  followed  by  much  longer  error- 
free  intervals. 

The  data  implies  that  consideration  should  be  given  to  error  correc¬ 
tion  by  retransmission  and  hybrid  means.  The  presence  of  long  bursts 
tends  to  preclude  use  of  short  block  coding  unless  long  delay  interleavers 
are  used. 
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